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Detection of galaxy assembly bias 
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ABSTRACT 

Assembly bias describes the finding that the clustering of dark matter haloes depends 
on halo formation time at fixed halo mass. In this paper, we analyse the influence 
of assembly bias on galaxy clustering using both semi-analytical models (SAMs) and 
observational data. At fixed stellar mass, SAMs predict that the clustering of central 
galaxies depends on the specific star formation rate (sSFR) , with more passive galax- 
ies having a higher clustering amplitude. We find similar trends using SDSS group 
catalogues, and verify that these are not affected by possible biases due to the group 
finding algorithm. Low mass central galaxies reside in narrow bins of halo mass, so 
the observed trends of higher clustering amplitude for galaxies with lower sSFR is 
not driven by variations of the parent halo mass. We argue that the clustering de- 
pendence on sSFR represent a direct detection of assembly bias. In addition, contrary 
to what expected based on clustering of dark matter haloes, we find that low-mass 
central galaxies in SAMs with larger host halo mass have a lower clustering amplitude 
than their counter-parts residing in lower mass haloes. This results from the fact that, 
at fixed stellar mass, assembly bias has a stronger influence on clustering than the 
dependence on the parent halo mass. 
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1 INTRODUCTION 

Empirical models that link galaxy properties statisti- 
cally with their hosting dark matter (sub)halo masses 
include the Halo Occupat i on D i stribution ( HOD) 
method jBerlind fe Weinberg! 120021 : lYang et all 120051 : 



I Wang et all 120061). an d the abundance matching metho d 
jVale fc Ostrikerll2006l : iGuo et alJlioici ; iMoster et ai1l2010l) . 

The HOD method places galaxies into dark matter haloes 
by modeling the number and the spatial distribution of 
galaxies in haloes of given mass. The abundance matching 
method assumes a one-to-one correspondence between 
galaxies and subhaloes, with higher mass galaxies residing 
in subhaloes of higher mass at the time of accretion. Both 
methods assume that galaxy properties depend only on 
halo mass, and are independent of any other properties of 
the parent halo. This assumption, which also applies to 
the excursion set (Extended Press-Schechter) formalism 
ijBond et al.lll99ll ). however, is challe nged by th e disco very 
of the assembly bias effect found by iGao et all (|2005f ) for 
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haloes less massive than about 10 13 h -1 Mq. Assembly bias 
indicates the finding that at the same halo mass, earlier 
assembled haloes cluster more than the ones that assemble 
late. Evidence that galaxy properties might depend on other 
properties than halo mass has also been fou n d in different 
galax y formation models (|Zhu et al.l 120061 : ICroton et al.l 
l2007t) . 



From an observational point of view, lYang et al.l (|2006l ) 
found that the clustering of galaxy groups of fixed halo 
masses depends on the star formation rate of the central 
galaxies which are located at the centres of dark mat- 
ter haloes. This is probably the first claim of observa- 
tional detection of halo assembly bias. Along this line, 
subsequent studies have also searched for a correlation 
between gal axy propert i es an d halo prope r ties a t fixe d 
halo mass dWang et al l |200S| ; iTinker et all l201ll . 120121 1. 
iKauffrnann et al.l Q2012J) have found interesting evidence 
that central galaxy properties are correlated with the prop- 
erties of their neighbours beyond t he virial radius of the ir 
parent haloes ( "galactic conformity" IWeinmann et al ]|2006t ). 
The existence of assembly bias in the real Universe is, how- 
ever, still debated (also because of uncertainties in esti- 
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Figure 1. In the DLB07 (upper two panels) and the Guoll (lower two panels) models, the halo formation rcdshift - M; n f a n relation and 
the fractional distribution (in unit of per Log(M; n f a n) dex) in M; n f a n in different stellar mass bins, for central (blue lines) and satellite 
(red lines) galaxies. Error bars indicate the 68 percentile distributions. 



mates of halo masses), and it remains unclear if (and to 
what extent) assembly bias influences the observed proper- 
ties of galaxies. In this paper, we analyse the dependence of 
galaxy clustering on different galaxy and halo properties in 
both semi-analytic galaxy formation models (SAMs) and in 
a group catalogue based on the SDSS DR7. Our main result 
is that central galaxies with lower specific star formation 
rate cluster more than the ones with more active star for- 
mation at fixed stellar mass, both in SAM and observations. 
As we argue below, this may provide evidence for assembly 
bias in the real Universe. 

The outline of our paper is as follows. In Section 2, we 
show the dependence of galaxy clustering on halo mass, halo 
formation time, and specific star formation rate of galaxies 
for two SAMs. In Section 3, observational correlation func- 
tions for central galaxies split by specific star formation rate 
and halo mass are calculated using group catalogues, and are 
compared with the predictions from mock galaxy catalogues 
from the two SAMs we analysed. Conclusions and discus- 
sions are presented in Section 4. All model results shown 
below are based on dark matter halo t rees from the Mil- 
lennium Simulation (|Springel et alj|2005h . The resolution of 
the simulation gives a lower limit in the dark matter halo 
mass of around 10 11 h~ x Mq, and a lower limit in the galaxy 
stellar mass of around 6 x 10 9 M Q . 



2 EFFECT OF ASSEMBLY BIAS IN 
SEMI-ANALYTIC MODELS 

In our previous study (|Wang et al.l 120131 ). we have shown 
that the stellar ma sses of galaxies extracted f rom the semi- 
analytic models of |Pe Lucia & Blaizot, (|2007l . DLB07) and 
iGuo et al] (|201ll . Guoll) depend not only on halo mass but 
also on halo formation time. In this section, we investigate 
further the correlation between galaxy properties and those 
of their parent haloes, with the aim of identifying likely indi- 
cations of assembly bias. In the following, Mhrfaii is the mass 



of host halo when galaxy is/was last time a central object 
of its hosting FOF group. For centrals, Mi n f a n=Mh a io- 

Fig. Q] shows the relation between the formation time 
of host haloes (defined as the redshift when half of Minf a ii 
was first assembled into a single object) and Mi n f a n in two 
SAMs. For a satellite galaxy, we trace back its main pro- 
genitors to the time when it was for the last time a cen- 
tral galaxy, and record the mass of its parent FOF group 
at this time as Mj n f a n. To define the corresponding forma- 
tion time, we then follow the main progenitor of the halo 
identified at the infall time, up to the time when its mass 
first reaches half of Mi n f a n. At all stellar masses, less mas- 
sive haloes assemble much earlier than more massive ones. 
We also show in Fig. [T] the distribution of Afi n f a n for these 
galaxies. At low stellar masses, the distributions are quite 
narrow. We note that the bias varies very weakly as a func- 
tion of halo mass for haloes w ith Mj n f a n < 10 12 ' 5 /i -1 Mq (e.g. 
ISheth. Mo. fc Tormedl200ll) : for the two lowest stellar mass 
bins considered, the maximum variation for the bias is at 
the level of 10— 15%. Therefore, for low stellar masses, large 
variations in the clustering as a function of galaxy physical 
properties cannot be attributed to variations of halo masses. 
For massive galaxies, on the other hand, the distributions of 
halo masses are rather extended and for haloes more massive 
than ~ 10 12 ' 5 Mq the bias increases rapidly as a function 
of halo mass. Therefore, for massive galaxies, one expects a 
stronger clustering variation due to a wider range of host 
halo masses. 

In Fig. [2] we check the dependence of galaxy cluster- 
ing on halo formation time, sSFR, and Mi n f aU at fixed stel- 
lar mass in the models of DLB07 and Guoll in two stel- 
lar mass bins. For each stellar mass bin, galaxies are split 
into different sub-samples according to the median value of 
each physical parameter considered. We show results both 
for the full population of galaxies, and for centrals and satel- 
lites separately. The top two panels show that galaxies re- 
siding in earlier formed haloes cluster more strongly than 
their counterparts residing in haloes that formed later, which 
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Figure 2. Correlation functions (CFs) of galaxies split by halo formation time, sSFR and M; n f a n in two stellar mass bins of 
log(M s tar/M Q )=[9.77,10.27] and [10.77,11.27] in the two SAMs. For each stellar mass bin, results are shown for all galaxies in the 
mass bin, as well as for central and satellite galaxies separately. In each panel, a black line is used to show the CF for all galaxies in the 
corresponding mass bin, while coloured lines correspond to sub-samples with values of the considered physical property smaller or larger 
than the median. 



reflects the halo assemb ly bias (|Gao et al.ll2005l ;l Zhu et al.l 
l200fj ; |Croton et al.ll2007l '). 

When split by sSFR, we find that galaxies with lower 
sSFR cluster more. This is consistent with observations that 
old and red galaxies cluster more even at fixed stellar masses 
|Li et al.ll200r3 ; iBamford et"ai]|2009l ; IWeinmann et a,lj|201ll ) 
and is mainly due to the higher clustering of satellites vs. 
centrals. However, we find that the result is still present 
when considering only central galaxies, or only satellite 
galaxies at scales smaller than l/i~ 1 Mpc in the Guoll 
model. In both SAMs, for the small stellar mass bin con- 
sidered, the clustering of central galaxies depends on their 
sSFR. We will show later that this trend is not driven by 
trends as a function of parent halo mass. 

In Fig. [3] we show the relation between galaxy sSFR 
and galaxy assembly time, defined as the time when half 
of the galaxy stellar mass of the present day is assembled 
in a single progenitor of the galaxy. In general, for galaxies 
with stellar mass lower than 1O 1O ' 77 M0, galaxies with lower 



sSFR assemble earlier. Combining this result with the find- 
ing that low-mass central galaxies with lower sSFR cluster 
more, we find that the clustering of low-mass central galax- 
ies in SAMs depends on the galaxy assembly time, i.e. there 
is an assembly bias. 

The result that galaxies with low sSFR cluster more 
than their counter-parts with higher sSFR could be due to 
a dependence of galaxy clustering on halo mass: the clus- 
tering amplitude increases with increasing halo mass, so if 
galaxies with lower sSFR are sitting in more massive haloes, 
this would explain the trends found. However, this is not 
the case for low mass galaxies. In the bottom two rows of 
Fig. [2] we show the galaxy correlation functions (CFs) split 
by Mjnfaii. For low mass galaxies, those with smaller Marfan 
actually cluster more strongly in both SAMs, for both cen- 
trals and satellites. This can be understood as follows: as 
shown in Fig. [T] at fixed stellar mass, less massive haloes 
form earlier. And earlier formed haloes cluster more due to 
halo assembly bias. In addition, the bias of the host haloes of 
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low mass galaxies varies little as mentioned before, which re- 
sults into little variations of galaxy clustering amplitude due 
to varying host halo mass. Therefore, for the lower stellar 
mass bin considered, the dependence of clustering on halo 
formation time has a larger influence than the dependence of 
clustering on halo mass. For the higher stellar mass bin, the 
dependencies on halo mass and halo formation time com- 
pensate, and CFs of galaxies with different Afi n f a n are quite 
similar, with only slight differences for satellites on small 
scales. 



3 ASSEMBLY BIAS IN OBSERVATION 

Can we find assembly bias as seen in the SAMs also in re- 
ality? While it is impossible to measure the halo formation 
time for observational results, we can check galaxy cluster- 
ing as a function of halo mass and sSFR, which could be 
affected by assembly bias, as shown in Section 2. 

For satellite galaxies, besides the possible effect of as- 
sembly bias on their clustering property, there are other 
complications due to the fact that they can be accreted over 
a range of cosmic epochs. Therefore, in the following, we 
focus only on central galaxies. 

3.1 SDSS results 

We make use of t he galaxy (|Blanton et alj|2005f ) and group 
(|Yang et al.ll2007l ) catalogues extracted from the SDSS DR7 
data, and use these catalogues to distinguish between central 
and satellite galaxies. For central galaxies within different 
stellar mass bins, we split galaxies into two subsamples with 
equal numbers of galaxies, according to the median value of 
sSFR and Mhaio respectively in this mass bin, and calculate 
the projected CFs for each subsample. The details of how 
to get the projected 2P CFs of SDSS DR 7 galaxies can be 
found in Appendix A of I Yang et ail (|2012l V 

In Fig. U CFs of subsamples in observation are shown 
for four different stellar mass bins. At low stellar masses, 
central galaxies with lower sSFR cluster more than those 
with higher sSFR, which is consistent with the trends found 
in the SAM. As argued in Section 2, this trend possibly indi- 
cates a signature of galaxy assembly bias in the data. Fig. [4] 
also shows that there is no significant difference between 
the clustering properties of sub-samples split by Mhaio in 
the group catalogues. Note, however, that in the SDSS ob- 
servations (as well as in the mock catalogues that we will 
use in the next subsection), halo masses are estimated from 
a simple ranking of the group stellar masses. In this way, 
splitting the sample as a function of halo mass just reflects 
trends as a function of stellar mass. Any possible formation 
time information associated with the mass of haloes in the 
real Universe will be erased due to the halo mass measure- 
ment adopted in the data. 

3.2 Comparison between SDSS and SAM mock 
catalogue 

To ensure a fair comparison with SAMs, we build mock cata- 
logues for both DLB07 and Guoll models, to mimic volumes 
and apparent magnitude limits of the observational data and 
to take into account the observational selection effects. The 



mock redshift catalo gues are construc ted in a way similar 
to that described in lYang et alj (|2004l ). where the detailed 
sky coverage of the SDSS DR7 including the angular varia- 
tions in the magnitude limit s and complete ness of the data is 
taken into account (see e.g. lLi et al.ll2007T l. Then, we adopt 
the same method used for the SDSS galaxies to define central 
and satellite galaxies, and to compute their CFs. The results 
from these models are also shown in Fig. [4] Similar to what 
shown in Fig. f2] for SAMs, central galaxies with lower sSFR 
cluster more than the ones with higher sSFR. The trend is 
consistent with SDSS measurements. Note that for central 
galaxies, the ratios between subsamples split in sSFR in the 
Guoll model are closer to the observational data, with re- 
spect to predictions from the DLB07 model. When satellite 
galaxies are also taken into account, however, the CFs in the 
Guoll model over-pre dicts the observational measuremen ts 
for low-mass galaxies (|Guo et al.ll201ll ; IWang et al.ll2013h . 

Comparing SAM mock catalogues and the results as 
presented in Fig. f2] we find some discrepancies. At small 
scales, the clustering in mock catalogues is somewhat en- 
hanced which might due to a small fraction of mis-classified 
centrals. Nevertheless, for the low-mass bin, at intermediate 
scales of r p =[0.5, 20]/i~ 1 Mpc, results are quite consistent 
between the mock group catalogue and the original SAM 
catalogues, and the dependence of clustering on sSFR is not 
affected by the algorithm used to detect groups and iden- 
tify central and satellite galaxies. Thus, we conclude that 
the finding that low sSFR centrals in the SDSS cluster more 
than their more active counterparts of the same stellar mass 
is real, and not due to spurious effects introduced by the 
group finding algorithm. 

When splitting galaxies by host halo masses, results 
from the SDSS and those from mock catalogues based on 
the SAMs used in our studies are consistent for galaxies less 
massive than 1O 1O ' 77 M0. However, the assembly bias trends 
associated with Mi n f a n shown in the lower-left panels of Fig. 
[2] do not show up in the mock catalogues. The reason is 
already mentioned in previous subsection that in assigning 
halo masses to groups, no additional information like forma- 
tion time is kept. 



4 CONCLUSION AND DISCUSSIONS 

In this paper, we find indications that the strong assembly 
bias of low-mass galaxies in the semi-anal ytic galaxy for- 
matio n models of lDe Lucia fc Blaizotl (|2007l ) and lGuo et alj 
|201ll ) leaves a signature on galaxy clustering as a func- 
tion of halo mass and sSFR. In particular, we find that cen- 
tral galaxies with low sSFR galaxies cluster more than their 
counterparts with the same stellar mass but higher sSFR. 

A similar trend is found in group catalogues based on 
the SDSS DR7. For low mass galaxies, this is likely a sig- 
nature of assembly bias in the real Universe. Alternative 
explanations are possible: for example, the central galaxies 
with lower sSFR could be associated with a 'back-splash' 
population of galaxies that have been satellites in the past. 
These galaxies are found to form earlier t han central galax- 
ies that did not experience such an event (|Wang et al.ll2009| , 
Li et al., in preparation), which may contribute to assembly 
bias. However, the fraction of central backsplash galaxies is 
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Figure 3. The sSFR - galaxy assembly time relation in different stellar mass bins in the two SAMs, for central (blue lines) and satellite 
(red lines) galaxies. Error bars indicate 68 percentile distribution. 
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Figure 4. CFs of central galaxies split by sSFR (upper two rows) and M^lo (lower two rows) in the mock group catalogues of DLB07 
and Guoll (solid/dashed lines), compared with the results of SDSS group catalogue (filled/open circles), in four different stellar mass 
bins. The ratios of CFs of subsamples with lower and higher sSFR (second row) and the ratios of CFs of subsamplcs with higher and 
lower ./Whalo (fourth row) are also presented, where error bars are the relative errors of the CFs for subsamples with higher sSFR and 
lower Mjjaioi respectively. 



lower than 10 per cent, and theref ore does not domin ate the 
assembly bias effect, as shown by I Wang et all (|2009l ). 

In SAMs, low mass central galaxies show a halo mass 
dependence that seems to contradict the halo model pre- 
dictions: here, low mass haloes cluster stronger than high 
mass haloes. The reason is that, for these low mass central 
galaxies, there is a strong correlation between host halo mass 
and halo formation time, with less massive haloes forming 



earlier. Haloes that form early cluster more than haloes of 
similar mass that form later (assembly bias). In addition, the 
clustering amplitude of the parent haloes of low-mass cen- 
tral galaxies varies very little. Therefore, the dependence of 
the clustering amplitude on halo formation time has a larger 
impact than the dependence due to variations of parent halo 
mass. 

More generally, we have found evidence that the clus- 
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tering of central galaxies does not only depend on host halo 
mass, but additionally on secondary parameters like the spe- 
cific star formation rate. While the physical reason for these 
trends remains unclear - it may be solely assembly bias or a 
combination of different effects - our results show that the 
simple picture that galaxy properties depend only on halo 
mass is incomplete. This has to be taken into account in em- 
pirical models like HOD and abundance matching that are 
based on this assumption, and for precision measurements 
of cosmological parameters using clustering studies. 
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